AITopics

Country:

Asia > China (0.46)
Africa (0.46)
North America > United States (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Information Technology > Services (0.68)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

Neural Information Processing SystemsApr-25-2026, 13:46:24 GMT

1c10d0c087c14689628124bbc8fa69f6-Supplemental-Conference.pdf

A.1 For LEHD model467 In Table 5, we explore the effects of eliminating normalization from the attention layer in our LEHD468 model. We train three LEHD models with the same training scheme and training budget, differing469 solely in the attention layer: one with batch normalization (BN), one with instance normalization470 (IN), and one without normalization (w/o). We train all three POMO models with the same reinforcement learning method477 with POMO strategy and training budget (1000 epochs). The results show that different types of478 normalization have few effects on the POMO model.479 The results in Table 6 show that removing normalization from attention layer has little impact on the480 model with a heavy encoder and a light decoder.

artificial intelligence, machine learning, node, (19 more...)

Genre: Research Report > New Finding (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.35)

Neural Information Processing SystemsFeb-11-2026, 04:52:10 GMT

cb7c403aa312160380010ee3dd4bfc53-Supplemental.pdf

graph, node, section 6, (16 more...)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)

Neural Information Processing SystemsFeb-10-2026, 21:32:45 GMT

b922ede9c9eb9eabec1c1fecbdecb45d-Supplemental.pdf

curr, dataset, node, (16 more...)

Country:

Asia > China > Heilongjiang Province > Harbin (0.08)
Asia > China > Beijing > Beijing (0.06)
Asia > China > Sichuan Province > Chengdu (0.05)

Industry: Transportation (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)

Neural Information Processing SystemsFeb-8-2026, 13:54:22 GMT

A Ablation study of normalization 466 A.1 For LEHD model

Instead, we can conclude that the underlying reason for the model's strong generalization In the original AM decoder, irrelevant nodes are masked during each construction step. Here is an extended explanation of Equation 2 in the case of TSP . The purpose of using this notation is to ensure solution alignment. By employing this notation, we can avoid such issues. Here is an extended explanation of Equation 2 in the case of CVRP .

artificial intelligence, machine learning, node, (18 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.48)

Tao, Nigel, Baxter, Jonathan, Weaver, Lex

A Multi-Agent, Policy-Gradient approach to Network Routing

arXiv.org Artificial IntelligenceDec-4-2025

Network routing is a distributed decision problem which naturally admits numerical performance measures, such as the average time for a packet to travel from source to destination. OLPOMDP, a policy-gradient reinforcement learning algorithm, was successfully applied to simulated network routing under a number of network models. Multiple distributed agents (routers) learned co-operative behavior without explicit inter-agent communication, and they avoided behavior which was individually desirable, but detrimental to the group's overall performance. Furthermore, shaping the reward signal by explicitly penalizing certain patterns of sub-optimal behavior was found to dramatically improve the convergence rate.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

2512.03211

Country: North America > United States (0.68)

Genre: Research Report (0.40)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

arXiv.org Artificial IntelligenceOct-14-2025

Self-Exploring Language Models for Explainable Link Forecasting on Temporal Graphs via Reinforcement Learning

Ding, Zifeng, Huang, Shenyang, Cao, Zeyu, Kondrup, Emma, Yang, Zachary, Huang, Xingyue, Sui, Yuan, Yuan, Zhangdie, Zhu, Yuqicheng, Hu, Xianglong, He, Yuan, Poursafaei, Farimah, Bronstein, Michael, Vlachos, Andreas

Forecasting future links is a central task in temporal graph (TG) reasoning, requiring models to leverage historical interactions to predict upcoming ones. Traditional neural approaches, such as temporal graph neural networks, achieve strong performance but lack explainability and cannot be applied to unseen graphs without retraining. Recent studies have begun to explore using large language models (LLMs) for graph reasoning, but most of them are constrained to static graphs or small synthetic TGs and lack the evaluation of the quality of reasoning traces generated by LLMs. In this work, we present Reasoning-Enhanced Learning for Temporal Graphs (ReaL-TG), a reinforcement learning framework that fine-tunes LLMs to perform explainable link forecasting on real-world TGs. ReaL-TG uses outcome-based reward to encourage models to self-explore reasoning strategies from graph structure and to produce explanations that directly justify their predictions. To enable evaluation on LLM-generated reasoning traces, we propose a new evaluation protocol combining ranking metrics with an LLM-as-a-Judge system that assesses both the quality of reasoning and the impact of hallucinations. Experiments with ReaL-TG-4B, obtained by fine-tuning Qwen3-4B under our framework, show that it outperforms much larger frontier LLMs, including GPT-5 mini, on ranking metrics, while producing high-quality explanations confirmed by both the LLM judge and human evaluation.

large language model, machine learning, node, (20 more...)

2509.00975

Country:

Asia (1.00)
North America > United States (0.46)
North America > Mexico (0.28)
Europe > Austria > Vienna (0.14)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceOct-14-2025

Graph Neural Network-Based Multicast Routing for On-Demand Streaming Services in 6G Networks

Wang, Xiucheng, Wang, Zien, Cheng, Nan, Xu, Wenchao, Quan, Wei, Shen, Xuemin

The increase of bandwidth-intensive applications in sixth-generation (6G) wireless networks, such as real-time volumetric streaming and multi-sensory extended reality, demands intelligent multicast routing solutions capable of delivering differentiated quality-of-service (QoS) at scale. Traditional shortest-path and multicast routing algorithms are either computationally prohibitive or structurally rigid, and they often fail to support heterogeneous user demands, leading to suboptimal resource utilization. Neural network-based approaches, while offering improved inference speed, typically lack topological generalization and scalability. To address these limitations, this paper presents a graph neural network (GNN)-based multicast routing framework that jointly minimizes total transmission cost and supports user-specific video quality requirements. The routing problem is formulated as a constrained minimum-flow optimization task, and a reinforcement learning algorithm is developed to sequentially construct efficient multicast trees by reusing paths and adapting to network dynamics. A graph attention network (GAT) is employed as the encoder to extract context-aware node embeddings, while a long short-term memory (LSTM) module models the sequential dependencies in routing decisions. Extensive simulations demonstrate that the proposed method closely approximates optimal dynamic programming-based solutions while significantly reducing computational complexity. The results also confirm strong generalization to large-scale and dynamic network topologies, highlighting the method's potential for real-time deployment in 6G multimedia delivery scenarios. Code is available at https://github.com/UNIC-Lab/GNN-Routing.

artificial intelligence, machine learning, node, (19 more...)

2510.11109

Country: Asia > China (0.46)

Genre: Research Report > New Finding (0.93)

Industry:

Information Technology (0.93)
Telecommunications > Networks (0.67)
Media > Television (0.50)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Abd-Elmagid, Mohamed A., Shi, Ming, Ekici, Eylem, Shroff, Ness B.

Online Learning for Optimizing AoI-Energy Tradeoff under Unknown Channel Statistics

arXiv.org Artificial IntelligenceSep-24-2025

We consider a real-time monitoring system where a source node (with energy limitations) aims to keep the information status at a destination node as fresh as possible by scheduling status update transmissions over a set of channels. The freshness of information at the destination node is measured in terms of the Age of Information (AoI) metric. In this setting, a natural tradeoff exists between the transmission cost (or equivalently, energy consumption) of the source and the achievable AoI performance at the destination. This tradeoff has been optimized in the existing literature under the assumption of having a complete knowledge of the channel statistics. In this work, we develop online learning-based algorithms with finite-time guarantees that optimize this tradeoff in the practical scenario where the channel statistics are unknown to the scheduler. In particular, when the channel statistics are known, the optimal scheduling policy is first proven to have a threshold-based structure with respect to the value of AoI (i.e., it is optimal to drop updates when the AoI value is below some threshold). This key insight was then utilized to develop the proposed learning algorithms that surprisingly achieve an order-optimal regret (i.e., $O(1)$) with respect to the time horizon length.

algorithm, artificial intelligence, machine learning, (16 more...)